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Abstract 

We prove the strong converse for the A^-source Gaussian multiple access channel (MAC). In particular, we 
show that any rate tuple that can be supported by a sequence of codes with asymptotic average error probability 
less than one must lie in the Cover-Wyner capacity region. Our proof consists of the following. First, we perform 
an expurgation step to convert any given sequence of codes with asymptotic average error probability less than 
one to codes with asymptotic maximal error probability less than one. Second, we quantize the input alphabets 
with an appropriately chosen resolution. Upon quantization, we apply the wringing technique (by Ahlswede) on the 
quantized inputs to obtain further subcodes from the subcodes obtained in the expurgation step so that the resultant 
correlations among the symbols transmitted by the different sources vanish as the blocklength grows. Finally, we 
derive upper bounds on achievable sum-rates of the subcodes in terms of the type-II error of a binary hypothesis 
test. These upper bounds are then simplified through judicious choices of auxiliary output distributions. Our strong 
converse result carries over to the Gaussian interference channel under strong interference as long as the sum of the 
two asymptotic average error probabilities less than one. 

Index Terms 

Gaussian multiple access channel. Strong converse, Binary hypothesis testing. Expurgation, Wringing technique 


I. Introduction 

The multiple access channel (MAC) is one of the most well-studied problems in network information theory [1]. 
The capacity region of the discrete memoryless MAC was independently derived hy Ahlswede [2] and Liao [3] 
in the early 1970s. In this paper, we are interested in the Gaussian version of this problem for which the channel 
output Y corresponding to the inputs (Vi, Y 2 ,... ,Xn) is 

N 

y = Y,x, + z, (1) 

i=l 

where Z is standard Gaussian noise. We assume an average transmission power constraint of Pi corresponding to 
each transmitter iG{l,2,...,iV}. The capacity region was derived by Cover [4] and Wyner [5] and is the set of 
all rate tuples {Ri, R 2 , ■. ■, Rn) G that satisfy 

i£T ^ ieT ' 

for all subsets TC{l,2,...,At}. For the N = 2 case, the pentagonal region of rate tuples in (2) is known as the 
Cover-Wyner region and is illustrated in Figure 1. 

Despite our seemingly complete understanding of fundamental limits of the Gaussian MAC, it is worth high¬ 
lighting that in the above-mentioned seminal works [2]-[5], it is assumed that the average error probability tends to 
zero as the length of the code grows without bound. This implies that those established converses are, in fact, weak 
converses. Fano’s inequality [1, Sec. 2.1] is typically used as a key tool to establish such weak converses. In this 
work, we strengthen the results of Cover [4] and Wyner [5] and show that any rate tuple that can be supported by a 
sequence (in the blocklength) of Gaussian multiple access codes with asymptotic average error probability strictly 
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Fig. 1 . Capacity region of the two-encoder Gaussian MAC [ 4 ], [ 5 ]. We use the shorthands Ci = ^ log(l + Pi) and C112 = | log(l -I- 
Pi /(1 + P2)) and similarly for C2 and C'211. 


less than one (and not necessarily tending to zero) must lie in the Cover-Wyner region. This is a strong converse 
statement, akin to the work on strong converses for point-to-point channels hy Wolfowitz [6]. It indicates that 
the boundary of the Cover-Wyner region designates a sharp phase transition of the smallest achievable asymptotic 
error probability, which is zero for any rate tuple inside the capacity region and one for any rate tuple outside the 
capacity region. Thus, this work augments our understanding of the first-order fundamental limit of the Gaussian 
MAC. Additionally, it may also serve as a stepping stone for studying the second-order asymptotics [7]-[10] or 
upper bounds (e.g., the sphere-packing bound) on the reliability function of the Gaussian MAC (cf. [11, Th. 4]). 

A. Related Work 

The study of MACs has a long history and we refer the reader to the excellent exposition in El Gamal and Kim [1, 
Ch. 4] for a thorough discussion. Dueck [12] proved the strong converse for the (two-source) discrete memory less 
MAC using the the technique of blowing up decoding sets originally due to Ahlswede, Gacs and Komer [13], 
combined with a novel strategy known as the wringing technique. The technique of blowing up decoding sets uses 
the so-called blowing-up lemma [13], [14] (see also [15, Ch. 5] or [16, Sec. 3.6]). This technique is useful for 
establishing strong converse results for memoryless channels with finite output alphabets. 

Dueck’s proof proceeds in three steps. First, Dueck expurgates an appropriate subset of codeword pairs to convert 
any given sequence of codes with asymptotic average error probability bounded away from one to a sequence of 
codes with asymptotic maximal error probability bounded away from one.* This expurgation step is performed so 
that the blowing-up lemma to be applied in the third step yields tight upper bounds on the sum-rate, which will then 
lead to the desired strong converse result. Unfortunately, the expurgation step introduces undesirable correlations 
among the codewords transmitted by the N encoders. Second, a wringing technique is introduced to wring out any 
residual dependence among the symbols transmitted by the N encoders by choosing a further subcode from each 
subcode obtained in the expurgation step. Wringing is necessary for establishing a tight sum-rate bound, because 
the sum-rate capacity of the MAC is expressed as the supremum of mutual information terms over all independent 
input distributions (the independence is due to the fact that the N encoders do not cooperate). Third, the blowing-up 
lemma is applied to the resultant subcode to yield a tight upper bound on the sum-rate. 

Ahlswede [17] presented another strong converse proof for the (two-source) discrete memory less MAC by 
modifying Dueck’s wringing technique as well as replacing the use of the blowing-up lemma in Dueck’s proof with 
an application of Augustin’s non-asymptotic converse bound [18]. However, the proofs of Dueck and Ahlswede 
are specific to the discrete (finite alphabet) setting and it is not clear by examining the proofs that the same strong 
converse statement follows in a straightforward way for the Gaussian MAC with peak power constraints. 

Another approach to proving the strong converse for a general MAC is due to Han [19], who used the information 
spectrum technique [20] to provide a general formula for MACs and stated a condition [19, Th. 6] for the strong 
converse to hold. However, unlike for the point-to-point setting [20, Sec. 3.6-3.7], the property is difficult to verify 
for various classes of memoryless MACs. 

'Although the capacity region of the Gaussian MAC is well-known when it is defined in terms of the average error probability [ 1 ], [ 4 ], 
[ 5 ], the determination of the capacity region is an open problem if it is defined in terms of the maximal error probability. 
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In view of the above works and the practical and theoretical importance of strong converse theorems, we are 
motivated to provide a self-contained proof for the strong converse of the Gaussian MAC. 


B. Challenges in Establishing the Strong Converse and Our Strategies to Overcome Them 

In this subsection, we discuss the challenges of leveraging existing techniques to prove the strong converse for 
the Gaussian MAC. In particular, we highlight the difficulties in directly using the ideas contained in Dueck’s [12] 
and Ahlswede’s [17] proofs. We also describe, at a high level, the strategy we employ to overcome these difficulties. 
Finally, we discuss some other auxiliary proof techniques. 

1) Blowing-Up Lemma in Dueck’s Proof [12] Cannot be Directly Extended to Continuous Alphabets: In Dueck’s 
paper [12], he used a version of the blowing-up lemma, together with other tools, to prove the strong converse 
theorem for the discrete memoryless MAC. A crucial step in Dueck’s proof involves the establishing of an upper 
bound on the list size of possible messages for every output sequence based on the blown-up decoding sets. If the 
resultant list size is too large (e.g., contains an exponential number of messages), the Dueck’s technique cannot 
lead to the strong converse theorem. Since this crucial step heavily relies on the finiteness of the output alphabet 
and the output alphabet of the Gaussian MAC is uncountably infinite, it is not immediately apparent how to extend 
this step to the Gaussian case. 

2) Wringing Technique in Ahlswede’s Proof [17] Cannot be Directly Extended to Continuous Alphabets: As 
mentioned in the previous section, Ahlswede’s proof [17] is based on a modification of Dueck’s wringing technique 
and Augustin’s non-asymptotic converse bound [18]. However, it is not apparent how to adapt his techniques to 
obtain a strong converse bound on the sum-rate. More specifically, Ahlswede’s wringing technique (see Equation 
(5.3) in [17]) leads to the following sum-rate bound for any sequence of length-n codes whose asymptotic average 
error probability is bounded away from one: 


Ri + R2<I{XuX2]Y) + 0 


( 1 ^) 


(3) 


In (3), Xi and X 2 are independent random variables. However, the bound in (3) is sensitive to the sizes of the 
input and output alphabets, which prevents us from directly extending Ahlswede’s proof to the Gaussian case. 
Furthermore, there are no cost constraints in the discrete memoryless MAC and incorporating cost constraints does 
not seem to be trivial. A naive strategy to extend Ahlswede’s proof to the Gaussian case is to quantize the input 
and output alphabets of the Gaussian MAC so that Xi, X 2 and y depend on n and their cardinalities grow with n. 
Say we denote the quantized alphabets as xj^\ x!^^ and This sequence of quantized alphabets and the 
corresponding channels will be designed to provide increasingly refined approximafions fo fhe Gaussian MAC as n 
increases. In designing X^^\ X^^^ and y^’^f we would also like fo ensure fhaf fhe power consfrainfs are satisfied 
and fhe ferm O \xi'^'^\\X^'^'^\\y^'^'>\ in (3) vanishes as n fends fo infinify. However, quanfizafion argumenfs 

fhaf are used fo prove informafion-fheorefic sfafemenfs for confinuous-valued alphabefs are usually applied fo fhe 
achievabilify parfs of coding fheorems. For example, a quantization argumenf is used in [1, Sec. 3.4.1] for leveraging 
fhe achievabilify proof for fhe discrefe memoryless channel (DMC) wifh cosf consfrainfs fo prove fhe achievabilify 
parf of fhe capacify of fhe AWGN channel. To fhe besf of our knowledge, sfandard quantization argumenfs for 
achievabilify parfs do not work for strong converse proofs because upon quantization, one has to ensure that the 
resultant asymptotic error probability is bounded away from one. 

The reader is also referred to [21, Appendix D.6] for a complementary explanation of why Ahlswede’s original 
wringing technique works for only MAGs with finite alphabets but not the Gaussian MAC. 

3) Remedy — Combining a Quantization Argument with the Wringing Technique: The difficulties in directly 
using Dueck’s and Ahlswede’s techniques led the authors to combine a novel quantization argument together with 
Ahlswede’s wringing idea. We use a scalar quantizer of increasing precision in the blocklength to discretize (only) 
the input alphabets of the channel so that the Ahlswede’s wringing technique can be performed on the quantized 
channel inputs for any given code whose asymptotic error probability is bounded away from one. In doing so, we 
obtain a sequence of subcodes whose asymptotic error probability is bounded away from one such that the resultant 
correlations among the codeword symbols transmitted by the different sources vanish as n increases. Note that if 
the quantizer’s precision is too small or too large, the resultant upper bound on the sum-rate will be too loose 
and hence not useful in proving the strong converse. We discuss feasible choices of the quantizer’s precision and 
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the parameters used in the wringing technique in Section V-H. In our proof, the quantizer’s precision is chosen in 
such a way that the quantized input alphabets grow no faster than It turns out that this choice of 

quantization also allows us to control the approximation errors between the true channel inputs and the quantized 
ones uniformly. 

4) Other Ingredients in Our Proof: In Ahlswede’s proof of the strong converse for the discrete memory less 
MAC, he appealed to a non-asymptotic converse bound by Augustin [18]. In our proof we use a conceptually 
similar non-asymptotic converse bound that is motivated by modern techniques relating binary hypothesis testing 
to channel coding. In particular, we use a form of the meta-converse [22, Sec. III-E] due to Wang, Colbeck and 
Renner [23, Lemma 1]. We derive a multi-user version of this non-asymptotic converse bound. After doing so, 
we choose the auxiliary conditional output distributions therein to be product distributions that approximate the 
quantized code distribution. We note that the flexibility of the choice of the output distributions is essential for 
proving the strong converse for the Gaussian MAC as we can allow these distributions to depend not only on the 
peak powers but also the chosen precision of the scalar quantizer (cf. Section I-B3). 


C. Paper Outline 

In the next subsection, we state the notation used in this paper. In Section 11, we describe the system model and 
define the e-capacity region of the Gaussian MAC. In Section III, we present the main result of the paper. We present 
a few preliminaries for the proof in Section IV. The complete proof is then presented in Section V. Section VI 
extends our strong converse result to the two-source two-destination Gaussian IC under strong interference. 


D. Notation 

We use the upper case letter X to denote an arbitrary (discrete or continuous) random variable with alphabet X, 
and use a lower case letter x to denote a realization of X. We use X'^ to denote the random tuple (Ai, X 2 ,..., A„). 

The following notations are used for any arbitrary random variables X and Y and any mapping g whose domain 
includes X. We let px,Y and Py\x denote the probability distribution of (A, V) (can be both discrete, both 
continuous or one discrete and one continuous) and the conditional probability distribution of Y given A respectively. 
We let px,Y{x,y) and PY\x{y\x) be the evaluations of px,Y and Py\x respectively at (A, A) = {x,y). To avoid 
confusion, we do not write Pr{A = x,Y = y} to represent px,y{x, y) unless A and Y are both discrete. To make 
the dependence on the distribution explicit, we let Pip^{g{X) G A] denote pxix)l{g(x) G A} dx for any 
real-valued function g and any set A. The expectation and the variance of g{X) are denoted as Yp^[g{X)] and 
Xa.Yp^[g{X)] = Ep^[(( 7 (A) — Yp^[g{X)\f‘] respectively, where we again make the dependence on the underlying 
distribution px explicit. We let Af(• : M —)• [0,oo) denote the probability density function of a Gaussian 

random variable whose mean and variance are p, and respectively. This means that 

Tib 

We will take all logarithms to base 2 throughout this paper. The Euclidean norm of a vector x” G M” is denoted 

by lk”ll = 


II. Gaussian Multiple Access Channel 
We consider a Gaussian MAC that consists of N sources and one destination. Let 

X^{1,2,...,A} (5) 

be the index set of the sources (or encoders), and let d denote the destination (or decoder). The N message sources 
transmit information to the destination in n time slots (channel uses) as follows. Eor each z G X, node i chooses 
message 


IV, G{l,2,...,iW,("b 


(6) 
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in) 

and sends Wi to node d where denotes the message size. Based on Wi, each node i prepares a codeword 


X2 S M” to he transmitted and should satisfy 

n 

k=l 

where Pi denotes the power constraint for the codeword transmitted hy node i. Then for each k G {1,2,. 
each node i transmits Xi^k in time slot k and node d receives the real-valued symbol 

Xk = Xi^k + ^k, (7) 

i£l 

where Zi,Z2,...,Zn are i.i.d. and Zi is a standard Gaussian random variable. After n time slots, node d 
declares {Wi}i^x to be the transmitted {Wi]i^x based on 

To simplify notation, we use the following convention for any T PI. For any random tuple (Xi,X 2 , ■ ■ ■ ,Xn), 
we let 

XT = iXi\iG T) (8) 

be its subtuple, whose generic realization and alphabet are denoted by xt and 

= (9) 

i£T 

respectively. Similarly, for any A: G {1, 2,... , n} and any random tuple (Xi fc, ^ 2 ^^,... ,X]\[^k) G Aj, we let 

XT,k = {Xi^k I i e T) (10) 


be its subtuple, whose realization is denoted by xx^k- The following five definitions formally define a Gaussian 
MAC and its capacity region. 

Definition 1: Let T be a non-empty subset in I. An {n, M^\ Px,A,T)-code for the Gaussian MAC, where 
..., and Px = {Pi,P 2 , ■ ■ ■, Pn), consists of the following: 

1) A message set Wi = {1,2,..., at node i for each i G X. 

2) A support set of the message tuple Wx denoted by ^ C Wx where Wx is uniform on A. In addition, all the 
tcx’s in A have the same wt<^, i.e., there exists a wfia G >Vr-= such that for all wx G A, we have wx^ = 
Define 

Ax — {wx G Wx I There exists a wx G A such that wx = wx} (11) 

to be the support of Wx- Consequently, the message tuple Wx is uniform on Ax- 

3) An encoding function fi : >Vi —>• M” for each i G X, where fi is the encoding function at node i such that 
Xf = ffiWi) and 

mwi)f < nP, ( 12 ) 

for all Wi G Wi. The set of codewords {ffil), fi{2 ),... , is called the codebook for Wi. For each 

z G X, the finite alphabet 

Ai = {x G M |x is a component of ffiwi) for some Wi G Wi} (13) 

is called the support of symbols transmitted by i because ffiWi) P Xfi. Note that 

lA’il < nMf”) (14) 

for each i G X by (13). 

4) A (possibly stochastic) decoding function p : M” —>• A, which is used by node d to estimate the message 
tuple Wx, i.e., Wx = p{W^). 

If .A = Wx and T = X, then Wx is uniformly distributed on Wx, which implies that the N messages are mutually 
independent. Since (n, Px, VV’i,X)-codes are of our main interest, they are also called (n, , Pj)-codes 

for notational convenience. However, in the present work, it is necessary to allow A and T to be strict subsets of 
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Wx and X respectively so the generality afforded in the above definition is necessary. In this case, the N messages 
need not he independent. In the rest of this paper, if we fix a code with encoding functions {fi\i € X}, then Xi as 
defined in (13) denotes the support of symbols transmitted by each z G X. 


Definition 2: A Gaussian MAC is characterized by the conditional probability density function QyiXx satisfying 

(15) 




for all xx G and all y G M such that the following holds for any {n, M^\ Px, A,T)-code: Let be 

the probability distribution induced by the (n, , Px,A, T)-code. Then, 


PW:,,Xi,Y»{wx,Xx,y'^) =PWx{wx) WpYk\Xx,Syk\xX,k) 


(16) 


\i&X 


\k=l 


for all {wx, y^) G ,4. X Afj x M” where 

PYk\Xx,kiyk\xX,k) - <lY\Xx{yk\xx,k)- (IV) 

Since PYk\Xx k depend on k by (17) and (15), the channel is stationary. 

For any {n, M^\ Px,A,T)-code defined on fhe Gaussian MAC, let P]y^x"Y'^Wx distribution 

induced by the code. Since Wx is a function of by Definition 1, it follows that 


which implies from (16) that 


PWx,Xi,Y”-,Wx “ FW'i,X",Y"F^^|yn, 


PWx,Xi,Y^,Wx —PWx,Xi ( WPYk\Xx,k I PWx\Y'' 


\k=l 


(18) 


(19) 


Definition 3: For an {n, M^\ Px)-code defined on the Gaussian MAC, we can calculate according to (19) the 
average probability of decoding error which is defined as 

Ft{WxAWx}. (20) 

An (n, M^\Px)-code wifh average probabilify of decoding error no larger fhan e is called an (n,A4"\Px,e)avg- 
code. Similarly for an (n, ,Px,A, r)-code, we can calculate the maximal probability of decoding error defined 
as 

max Prj Wt A Wt \Wx = wx]■ (21) 

Wx&A ^ ^ 


An (n, ,Px,A, r)-code wifh maximal probabilify of decoding error no larger fhan e is called an {n,M^^\Px, 
A, T, e)i„ax-code. 

Definition 4: A rafe fuple Rx = {Ri, R 2 , ■ ■ ■ , Rn) is s-achievable for fhe Gaussian MAC if fhere exisfs a 
sequence of {n, M^\ Px, Sn)avg-codes on fhe Gaussian MAC such fhaf 

lim inf — log > Ri (22) 

n—foc Tl ^ 


for each z G X and 


lim sup En < £■ 

n^oQ 


(23) 


Definition 5: For each e G [0,1), fhe e-capacity region of fhe Gaussian MAC, denofed by Cg, is fhe set consisting 
of all e-achievable rate tuples Rx- The capacity region is defined fo be fhe 0-capacity region Cq. 
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III. Main Result 

The following theorem is the main result in this paper. 

Theorem 1: Define 

7^cw = n ^ I Ri < ^ log (l + EisT Pi) } ■ (24) 

TCX 

Then for each £ G [0,1), 

Ce C T^cw- (25) 


We now present three remarks concerning Theorem 1. 

1) Note that TZcw is the Cover-Wyner [4], [5] region for an A^-source Gaussian MAC. The theorem says that 
regardless of the admissible average error prohahility (as long as it is strictly smaller than 1), all achievable 
rate tuples must lie in TZcw- Since all rate tuples in TZqvj are 0-achievable [1, Sec. 4.7], we have for every 

e G [0,1) 

Ce — T^cw- (26) 


2) In fact, the proof allows us to additionally assert the following: For any non-vanishing average error probability 
£ G [0,1) and any subset T C X, it can be shown that the sum-rate of the messages indexed by T of any 
sequence of (n,Px,e„)avg-codes satisfying the constraint in (23) also satisfies 


lim sup —j^=^= 
n^oo y/nlogn 


log 

A&T 


?log 1 + 


iST 


< T{£,T,Px) < OO 


(27) 


for some finite consfant T(e, T, Pj). See (179) in fhe proof of Theorem 1. Even fhough fhe normalizing 
speed of y/nlo^ is nol fhe desired y/n (as usually defined in second-order asympfofic analyses [7]), fhe 
techniques in fhis work may serve as a stepping stone fo esfablish an outer bound for fhe second-order coding 
rate region [7] for fhe Gaussian MAC. The besf inner bound for fhe second-order coding rates for fhe Gaussian 
MAC was esfablished independenfly by Scarleff, Martinez, and Guillen i Fabregas [8] and MolavianJazi and 
Laneman [9]. According fo fhe inner bounds in [8], [9] and fhe relafion befween second-order coding rates 
and second-order asympfofics of sum-rales in [10], 


lim inf —^ 

n->-oo 


'Y log 

-ieT 


n 


log 


i + E« 

i&T 


> X(e, T, Px) > -OO 


(28) 


for some finite constant T(e,T, Pj). Our normalizing speed of y/n log n in (27) is slightly better than in 
Ahlswede’s work on the discrete memoryless MAC [17], which is yTilogn. We have attempted to optimize 
(reduce) the exponent of the logarithm ^ > 0 in the normalizing speed y/n{logn)‘^. However, as we will 
discuss in Section V-H in the sequel, we are unable to use our proof technique to further reduce (improve) 
C from 1/2. For both the discrete and Gaussian MAGs, it is challenging to prove that the exact normalizing 
speed of the second-order term is y/n. This is, in part, due to the use of wringing technique in the converse 
part, which prevents one from obtaining a converse that matches the achievability in the rate of growth of 
the second-order term. Unless new techniques are invented to replace the wringing argument in the strong 
converse proof for the MAC (such techniques have remained elusive for over 30 years), the exact normalizing 
speed of the second-order term for the discrete and Gaussian MAGs will remain an open problem. 

In the next section, we will present a few preliminaries for the proof of Theorem 1, which will be detailed in 

Section V. 


IV. Preliminaries eor the Proof of Theorem I 
A. Expurgation of Message Tuples 

The following lemma is based on the technique of expurgating message tuples introduced by Dueck [12, Sec. II], 
and the proof is provided in the Appendix for completeness. 










Lemma 1: Let e G [0,1). Suppose an {n, M^\ Px,s)avg-code for the Gaussian MAC is given. Then for each 
nonempty T PI such that 


1 — e 
1 + £ 


there exist a set ^ C Wj and an (n, M^\Px, A, T, i^)^^^-code such that 


i&T 


(n) 


> 


1 — £ 

2(1 + e)) I 


IJM^, 


(29) 


i&T 


\At\ = 1^1 > 


1 — £ 

2(1 +£) ) ^ 


IfAf;"’, 


(30) 


i&T 


where At is as defined in (11). As a consequence, if we let PvVx X" y** tTx denote the prohahility distribution 
induced on the Gaussian MAC hy the {^n,M^\Px,A,T, i^)^^^-code, then we have for each wt G At 


PWt{wt) < 


1 



(^) 


(31) 


Remark 1: Lemma 1 says that restricted to the set At, the i* (for i & T) codehooks have almost the same sizes 
as the original codehooks. In addition, the conditional prohahility of decoding error for each message tuple in this 
restricted codehook is upper hounded hy which is still smaller than one because e G [0,1). According to (31), 

the probability of each message tuple cannot be greater than its original value by a factor of ( ). 


B. Wringing Technique 

The following lemma forms part of the wringing technique proposed by Ahlswede and its proof can be found 
in [17, Lemma 4]. 

Lemma 2: Let A” be a finite alphabet, let px" and ttx" be two probability mass functions defined on and 
let c > 0 be a real number such that 

PX" {x^) < (1 + c)ux" (x") (32) 


for all x” G Fix any 0 < A < 1. Then for any 0 < <5 < c, there exist I natural numbers in {1, 2,... , n}, 
denoted hy ti,t 2 , ■ ■ ■ ,ti, and I elements of X denoted by xt^, xt^, ■ ■ ■, x ^, such that the following three statements 
hold: 

(I) ^ < f ■ 

(II) Prp^„ { [Xi -^, Xi ^, • • •, Xi ^) = (xt\, Xt2 , • • •, Xti )} A A . 

(III) For all A: G {1, 2,... , n} \ • • •, ti}, we have 

FXfc|Xti,Xt2,...,Xt^ {xk\xt^,xt2, ■ ■ -Att) 

< vnax{{l + 5)ux^\x,^^x,2,-,x,^i.^k\xt„xt2,...,xt,),\} (33) 

for all Xk G X. 

The crux of Lemma 2 is in the identification of the event 

A' = {iXt,,Xt^,... ,Xt,) = {xt,,xt2,... ,xt,)} (34) 

such that conditioned on P, the distributions of the resultant codeword symbols transmitted in each time slot k 
can be approximated by ux^ (cf- (33)). In the sequel where each X^ in Lemma 2 is substituted by XT,k where 
XT,k is some quantized version of XT^k to be specified later, the joint distribution that approximates Px^^ 

will be chosen to be a product distribution (cf. (52)) with marginals In order to use Lemma 2 for proving 

Theorem 1, an important step involves controlling the size of X in Lemma 2. To this end, we use the following 
scalar quantizer to quantize the alphabet A) (in (13)) which is exponential in the blocklength n (cf. (14)) so that 
its quantized version is an alphabet whose size is polynomial in the blocklength. 

Definition 6: Let L be a natural number and A be a positive real number, and let 

^L,A = {-iA,(-L + l)A,...,LA} 


(35) 










be a set of 2L + 1 quantization points where A specifies the quantization precision. A scalar quantizer with domain 
[—LA, LA] and precision A is the mapping 


such that 


^L,A • [—LA, LA] —> 

[x/AJ A if X > 0, 




[x/ A] A otherwise. 


(36) 


(37) 


In other words, maps x to the closest quantized point whose value is smaller than or equal to x if x > 0, 

and to the closest quantized point whose value is larger than or equal to x if x < 0. In addition, define the scalar 
quantizer for a real-valued tuple as 

(38) 


such that 


0j;i:[-LA,LA]-^Z2,^ 

- (^L,A(a;i), • • • > ^LAi^n))- 


(39) 


By our careful choice of the quantizer in Definition 6, we have the following property for all x £ M: 

lo ifx>0, 

^ |-[x/A]A otherwise 

f [x/AJ A if X > 0, 

1 [—x/AJ A otherwise 

= Lkl/^J^ 

< jxj. 


(40) 

(41) 

(42) 

(43) 


Although the following lemma looks similar to [17, Corollary 2] and they both rely on Lemma 2, the proof of the 
following lemma is more involved due to the additional consideration of the quantizer’s precision and the quantized 
input symbols. If the quantizer’s precision is too small or too large, then the resultant bound obtained from the 
following lemma will not be useful in proving the strong converse. See Section V-H for a detailed discussion on 
the appropriate choice for the quantizer’s precision. 

Lemma 3: Suppose we are given an {n, M^\ Px, A! ,T, 2^)^^^-code such that 

1 — e 




(n) 


and 


PwA'^t) < 


riieT 


(n) 


2(1 + e) 

1 — e 


(44) 


(45) 


for each wt £ A'x where p' 


{n,M^\Px,A',T, l^)^^-code. Then, there exists an (n, Px,A,T, l^)^^-code with 




denotes the probability distribution induced on the Gaussian MAC by the 


(n) 


-4|T|{l + 3e) /—/ 1 — P 

l^rl = l-A] > n a-) ‘ 




(n) 


2 ( 1+^);.41 • 


(46) 


such that the following holds: Let Py^^ denote the probability distribution induced on the Gaussian MAC 

by the (n, M^\ Pj, A, T, l^)^^-code. In addition, let 

X? = 




define the alphabet 


Xi = Z 


|”n ,n~ 


(47) 

(48) 
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for each i gT (Xp is always in the domain of Z? , hecause of (47), (43) and (12), and hence Xp e XP), 

* \n\/nFi\ ,n~^ ' ' i i ' 

define 


XT = X{Xi 

i&T 


(49) 


and define 




i&T '' 


(50) 


for all {wx-, xp, x^, y”, wx) ^ Ax Xp x Xp x x A. Then there exists a distribution defined on Xp where 

3|T| 


\XT\<n^ll{2,/P, + 3) 

i&T 


(51) 


such that for all A: G {1, 2,... , re}, we have 


for all XT k ^ and 


Px^S^Xk) < max<^ 1 + 


i£T k=l 


log re 


re 


n ^ 




ieT 


n^m 


X, 


i.k 


< '^nPi. 
iGT 


(52) 


(53) 


Before presenting the proof of Lemma 3, we would like to stress the following two important implications of 
Lemma 3. 


(i) By identifying a certain event 

Q = { {XT,ti, XT,t2 , ■ ■ • ! XT,tt ) = {xT,ti At, t2, ■■■ , XT,ti )} 


(54) 


(whose prohahility is quantified in (70) in the following proof), we can find a subcode such that for each time 
slot k, the resultant probability distribution of the quantized vector of transmitted symbols XT,k = I * S 
T) can be approximated by a product distribution HieT^jv ^ essence of the wringing 

technique [12], [17] which involves approximating the joint distribution of the random variables corresponding 
to the different encoders with a product distribution. By approximating XT,k with a product distribution, we 
effectively wring out the dependence among the collection of random variables {Xj ^ | i G Tj. 

(ii) The alphabet size of the quantized transmitted symbol XT,k grows no faster than polynomially in re as in (51). 
Our quantization strategy that results in the polynomial growth of the alphabet sizes of the quantized symbols 
appears to be an important and necessary step, because the original alphabet size \Xt\ could be exponentially 
large in re (cf. (14)). Furthermore, the controlled growth of \Xt\ ensures that Pr{^} does not decay to zero 
exponentially fast as shown in (70) in the following proof and hence the asymptotic rates of the resultant 
subcode are the same as that of the original code. An important point to note here is the following: We are 
able to lower bound the probability Pr{^} because we defined Q in terms of the quantized random variables 
(rather than the original ones). The application of the wringing technique on the quantized random variables 
is one of the major contributions of the present work. 

Proof of Lemma 3: Let p'^ ^ be the probability distribution induced on the Gaussian MAC by the 

[n, Px, A' ,T, i^)^^^-code that satisfies (44) and (45), and let 


Pw,,XS,X^,Yr^,W^^^^ ^ 5 , y", wx) 


A f 

~ PWx,X^,Y' 


^^^{wx,xx,y ,-u;i)JJl|xj ) 

ieT k 


( 55 ) 
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Define a probability mass function u' - as 

Wt 


U 


Wt,x:},x: 


XWT,X^,X^)^11 




i&T 


M 


{n) 


(56) 


for all {wt, x^, x^) € Wt x ^ (^f- (^5) and (48)), where fi represents tbe encoding function for Wi of the 

(n, Px, r, i^)^^^-code (cf. Definition 1). The distribution u'^ x" X" well-defined (the probability 
masses sum to one) through (56) because 


E 




^w^,Xf,Xf(^T,x^,x^) 


(56) 


IVIa rr.n ^\>n A/-'T _ _ At-'T' 


idt^VVt i i^T 

= 1 . 


cs.eT" *sT 


‘ n\/nPi ,n“i ' ‘ ^ 


Using (56), we obtain 


Wt,X^,X^ 


n< 


Wi,xr,x" 


i&T 


where 


u. 


'Wi,xp,x; 


{wi,x'X,x'X) = 


M. 


(n) 


l{a;r = fiiwi)} • 1 = D 


'[nUrfR'l, 




(57) 

(58) 

(59) 

(60) 

(61) 


for all {wi,xf,x'l) £ Wi x X TT. We will use Lemma 2 to prove the existence of a subcode of the 
[n, M^\ Px,A' ,T, 7^)^^^-code such that the subcode satisfies (46), (52) and (53) for some defined on X^. 
To this end, we first consider the following chain of inequalities for each S X^ such that p'~ (x^) > 0: 


l‘Wr.X,.XpT,XT,XT) 


P'xpT) = E 

W'j' 

E Pw.Mp'x^,x^\wJ^tAtM 

W'J' Gi»4.^ 


(a) 


E AvrM n (l {X" = hM] ■ 1 {i? = o'"’ „_.(x“)}) 


(62) 

(63) 

(64) 


(45) 

< 


E 


1 


tOrS^yiXySXy rTigT 

( 56 ) 2(1 + e) 


(n) 


' i^) n (i w=/*(».)}■ 1 {*?=(X? 

/ \ r I I 


1 — s 


E 


'W'T 


^'wr,X-,X^^^T:X^,XT) 


< 


^(1 + g) 
\ — £ 


E 


iut ^ VVt 




where (a) follows from (16) and (55). It follows from (68) and Lemma 2 with the identifications 




^ 1 + Se 
c = 


I — £ 


A- ' 


n 


4|T| ’ 


<5 = 


log re 


re 


(65) 

( 66 ) 

(67) 

( 68 ) 

(69) 


that there exist £ natural numbers in {1,2,..., re}, denoted by fi, t 2 j ■■■,££, and £ real-valued |r|-dimensional tuples 
in Xt, denoted by XT,ti,XT,t 2 ^ ■ ■ ■ AT,te, such that the following three statements hold: 
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(I) I < (^) 


(II) Pr, 


- V/ V 

'in { ^^T,U > -^T,t2 > • • • > ) = (ST,ti, ®T,t2 , • • • , XT,t, ) I > • 


(70) 


(III) For all A: G {1, 2,... , n} \ {ti, i 2 , • • • ,te}, we have 


P'x lit X X ixT,k\^T,ti,XT,t2, ■ ■ ■ ,XT,ti) 


< max < 1 + 


= max < 1 + 


log n \ , 


n 


(®T,fc I a;T,ti, XT,t2 ! ■ ■ ■! ), 


logn 


n 


ieT 


(71) 

(72) 


for all XT,k S <^r- 

Using Statement (II), Statement (III) and (44), we can construct an (n, M^\Px, A, T, i^)^^^-code by collecting 
all the codewords for the {n, M^\ Px,A',T, P^)^^^-code which satisfy 

; ^T’,t2 ? • • • ; ^T,ti ) ; ^T,t2 ? • • • ? ^T,ti ) 

such that the following two statements hold: 

(i) MtI = Ml > n-*m‘ (j^) 

(11) Let y„ denote the probability distribution induced on the Gaussian MAC by the {n,Mi^\Px,A,T, 

22 ^) -code, and let 

2, / max 

x,X5,y^tLx(^^’^S,y">x) n 1 • 


- Pwi 


(74) 


Then, 


P'p.f h{U.i„= St,,„} =!, 

lm=l j 

and we have for all A: G {1, 2,..., n} \ {Ai,A 2 ! ■ ■ ■ Ai\ 

Px^ ,ixT,k) < max<^ 1 + 


logn 


n 


n^' 


i&T 


k I t p —t p , ±2 —, t2 i • • • 


(Xi,k 




(75) 


(76) 


for all XT,k S <n- 
Since for each A: G {U, t 2 ) ■ ■ ■ j 


Px^ ^{xT,k) 1 {xT,k = XT,k} = n “■ 


iST 


'Xi,k\Xi.t.,=Xi,t^,Xi,t2=Si,t2,-,Xi,te=Xi,tf. 


(77) 


for all xx^k S tTt, it follows from (76) that the following statement holds: 
(ill) For all A; G {1, 2,... , n}, we have 


Px^,S^T,k) < max<^ 1 + 


logn 


n 


n^' 

ieT 


'Xi^lc\Xi^ti =Xi^ti >Xi,t2 =Xi,t2 =Xi^t 


[Xi,k) 


A\T\ 


n 


(78) 


for all XT,k S Xt- 

Consequently, (46) follows from Statement (1) and Statement (I), and (52) follows from Statement (ill) by letting 


n -yn - 

y\. rj-t 


nn^; 


k=li^T 


‘Xi^k I Ai.tj =a:i,ti ,Xi,t2 =Xi,t2,---,Xi,ti =Xi 


(79) 
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It remains to prove the upper bounds on \Xt\ and ELi E. 


prove (51), we consider 




n (2 


+1 


i&T 


< [] (2n3/2/^ + 3 

i&T 

<n^ J](2v^ + 3). 


in (51) and (53) respectively. To 

(80) 

(81) 

(82) 


i&T 


To prove (53), we first use (56) and (12) to obtain 


Px,,*. EE^.>sE"« = 1 - 

(ieT k=l i&T ) 

Since for alH G T and all A: G {1, 2,... , n} by (55) and (43), it follows from (83) that 


Consequently, 


p--“ijEETi!.<E"«| = i' 

KieT k=l i&T ) 


EEe- 




i,k 


i£T k=l 


(79) 


(60) 


X] XI 

2ET k=l 


■^i.k 


EEe. 

iGT k=l 
n 

=eep. 

ieT k=l 

= E„ 


^T,k\^T,ti—^T,ti'^T,t2 ~^T,t2 ' ■ ■ •'^T ,t£—^T ,t£ 


X 


i,k 


’^T,t2 -^T,t2 .. 


^i,k 


XJh\XT+A —^T.tA ’^7 


EE4 

,igT k=l 


(84) 

< 22 • 

i&T 


(83) 


(84) 


(85) 

( 86 ) 

(87) 

( 88 ) 

(89) 


C. Binary Hypothesis Testing 

The following definition concerning the non-asymptotic fundamental limits of a simple binary hypothesis test is 
standard. See for example [22, Sec. III-E]. 

Definition 7: Let px and qx be two probability distributions on some common alphabet X. Let 

Q({0, l}|Tf) = {rz\x I Z and X assume values in {0,1} and X respectively} 

be the set of randomized binary hypothesis tests between px and qx where {Z = 0} indicates the test chooses 
qx, and let <5 G [0,1] be a real number. The minimum type-II error in a simple binary hypothesis test between px 
and qx with type-I error no larger than 1 — (5 is defined as 

f3s{px\\qx) = inf / rz\xiMx)qx{x)dx. (90) 

rzixGg({0,l}IX): 

S^^x'^z\x{l\x)px{x) dx>S 
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The existence of a minimizing test rz\x is guaranteed by the Neyman-Pearson lemma. 

We state in the following lemma and proposition some important properties of which are crucial 

for the proof of Theorem 1. The proof of the following lemma can be found in, for example, the paper by Wang, 
Colbeck, and Renner [23, Lemma 1]. 

Lemma 4: Let px and qx be two probability distributions on some alphabet X, and let p be a function whose 
domain contains X. Then, the following two statements hold: 

1. Data processing inequality (DPI): 

l3s{px\\qx) < l3s{Pg{x)hgix))- (91) 

2. For all ^ > 0, 

Afeikx) > 1 («- a 

The proof of the following proposition is similar to Lemma 3 in [23] and therefore omitted. 

Proposition 5: Let puy be a probability distribution defined on 14’ x >V for some finite alphabet W. In addition, 
let qv be a distribution defined on W, and let 

a = maxPrfy u\U = u} (93) 

u&W 

be a real number in [0,1) where ([/, 1^) is distributed according to pu,v- Then for each u € W, 

Pi-a{pv\u=u\\Qv) < qv{u). (94) 


V. Proof of Theorem 1 
A. Expurgation to Obtain a Maximum Error Code 

Let e E [0,1) and suppose Rx is an e-achievable rate tuple. By Definition 4, there exists a 7 E [0,1) and a 
sequence of (n, Px, e„)avg-codes such that 

en<l (95) 


for all sufficiently large n and 

lim inf — log > Pj 

n—¥oo Tl ^ 

for each i Fix a non-empty set T CZ. Our goal is to prove that 

< ^log + 

iST ^ ieT 

Since (97) holds trivially if ~ 9, we assume without loss of generality that 


(96) 


(97) 


It follows from (96) and (98) that 


Y,Ri>0. 

i&T 


(98) 



> 


1 /1 — 7 


2 Vl + 7 


11 ". 

ieT 


(n) 


(99) 


for all sufficiently large n. Fix a sufficiently large n and the corresponding (n, Px, en)avg-code for the 
Gaussian MAC such that (95) and (99) hold. Using Lemma 1, Lemma 3 and Definition 1, there exists an 
(n, Px, A., T, ^^)^^j^-code, which induces a probability distribution on the Gaussian MAC denoted by 
PWx X" y™ Wx’ following four statements hold: 

(i) For all wx ^ A and all wx E At, 


PWxiwi) = and pwA^r) = 


( 100 ) 








15 


(ii) There exists a w^c € such that for all wx S A, we have wt<^ = 

(iii) The support of Wt satisfies 


-4|T|(1+3t) r^n— / 1 — 7 

\At\ = |-4.| > n V log^ 


2 ( 1 + 7 ) 


If", 

i&T 


(n) 


( 101 ) 


(iv) Define 




iST 


for all {wx, x^, xl^, y"', wx) & A x T’j? x x R” x A, where 

Xt = Yl^lnV^],r. 

and 


ier 


I^tI <n—J](2/^ + 3). 

i&T 

Then there exists a distrihution defined on such fhaf for all /c S {1, 2,... , n}, we have 


( 102 ) 


(103) 


(104) 


Px^^{xT,k) <max<j I 1 + 


logn 


n 


MT\ \ 

ieT ) 


(105) 


for all xx,k £ and 




Y‘2 

Xih 


< 


^nPi. 


(106) 


Note that x" Y" tTx distrihution induced hy the original (n, Pj, en)avg-code hut rather it is 

induced hy the expurgated (n,M^\Px,A,T, i^)^^^-code. 


B. Lower Bounding the Error Probability using Binary Hypothesis Testing 
Now, let 

A 


XI SYclXxc.fc 1 (107) 

k=l 

he a distrihution such that for each A: G {1, 2,... , n}, the auxiliary conditional output distrihution is chosen to he 


'®Wz,XJ,Y",Wx — PWj:,Xy ^11 ®Y=|XT<=,fc j PWx\Y^ 


SYk\XT<^,Ayk\xT-,k) =A( , [Xj^k] + ^ xj^kA + 


(108) 


i&T 


i6T<= 


i&T 


for all X 7 ’<=,fe £ and yk G R. It can he seen from (107) and (108) that y„ depends on the choice 

of T we fixed af the start of the proof and the distrihution Uj^„ in Statement (iv). We shall see later that this choice 

'^lYx X" Y" vVx’ 1 *^ particular the mean of the distrihution in (108) namely [Ai,k] + ^j,k^ 

combined with Proposition 5 and Lemma 4 enables us to prove (97). We do not index y„ by T nor 

for notational brevity. To simplify notation, let 7 = (1 + 7)/2 be the maximal probability of decoding error 
of the {^n, M^\ Px, A,T, l-|4)^^^^-code, where 7 < 1 because 7 < 1. Then for each wx G A, since 

/ 7 (107) , , (100) 

swt[wx) = pwi\wx) > 0 , 


( 109 ) 
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it follows from Proposition 5 and Definition 1 with the identifications U = Wt, V = Wt, Pu,v = P^t Wt\Wt- 
= ^Wt\Wtc=w^c and a = Pr{iyT / wt\Wi = m} < 7 that 

A-7 {PWt\Wx=Wx 11 I Wtc =Wt<^ ) 

< l^l-a{PwT\Wx = Wx\\^WT\WT^=WT'^) 

^ SWx\Wr^^'^T\wT^)- 


C. Using the DPI to Introduce the Channel Inputs and Output 
Consider the following chain of inequalities for each wx S A: 


Pi-yip 


'y\yWx\Wx=Wxll^Wx\WT‘==WT<^ ^ 


(a) 


> /3i- 7 (Py n I Wx=wx 11 '^Y^,Wx\Wtc =wt<= ) 

= /3i- 7(py"I Wx=wxPWxIY’‘,Wx=Wx \\^Y”-,Wx\=W t‘= > 


(b) 


— f^l-'r{PY”-\Wx='WxPWx\Y^ W^Y",Wx\Wt‘^=Wt<^') 

(c) / \ 

— /5i- 7 (^Piy2;|ynPxj,y"|Wx=«)i Px?|x?e,iyx=-u'i'®j\:;5i„,y",iyx|w'TC=«;T<= j 


(107) / 

— pi-7 I P^x|y"^^z>t^"|tt^3:='^ 


a ( 

— Pl_7 I P5y^|ynPX5,y’*|Wx=Ulz 

= /3i- 7 ^PH7|y"P>’f5,’t'"|tyi=«)r 

/3i-7 ( PXi\Wx='WxPW:,\Y^ Y\.PYk\Xx.k 
\ k=l 


PX^\X^,,Wx=WxPX:}.,\Wtc=WtcPWx\Y- n ^YiclXx 

k=l 

n \ 

Px:}\x:}c,Wx=wxPx:^,\Wx=wxPwx\Y^ 11 

n 

PX"\Wx=WxPWx\Y’<- n ^y>c\XTC,k 


k=l 


k=l 


PXi\Wx=wxPWx\Y^ n ^Yk\XTC,k ] > 

k=l / 


where 

(a) follows from the DPI of hy introducing the channel output Y'^. 
(h) follows from the fact that 

Wi^Y^ ^ Wx 

forms a Markov chain under the distribution p^^ y„ 

(c) follows from the DPI of hy introducing the channel input X^. 

(d) follows from Definition 1, which says is a function of Wyc. 

D. Relaxation via Chebyshev’s Inequality 
Following (119), we consider 

n 

(19) 


' TT 

Pxj,y",tyx|Wi=-u;i “ Px£|tyz=-u)iPryx|t"" 


k=l 


and we obtain from Lemma 4 and (121) that for each wx & A and each ^^x > 0> 


/3i-7 [pXi\Wx=WxPWx\Y^ Y\.PYk\Xx,k 
1 


k=l 


PX^\Wx=WxPWx:\Y'^ n '®L=|Xxc 


k=l 




PY,,\Xx,k0^k\Xx,k) ^ ^ 


c ='Wt^ ’ 

( 110 ) 

( 111 ) 

( 112 ) 

(113) 

(114) 

(115) 

(116) 

(117) 

(118) 

(119) 

( 120 ) 


( 121 ) 


( 122 ) 
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Combining (111), (119) and (122), we obtain for each wx G A and each > 0 


1 


which implies that 
log 


{wt\wt'=) > 7 — 1 — 7 — Pr, 


e 


Wx 


PX^,Y-r^\W.j^ = Wj^ 


■TT PY^\X:,,t.{^k\Xx,k) \\ 


^WT\Wr^ {wt\wT‘) y 
< logCu;^ - log 1 - 7 - Pr 


PX5,'r"|Wx = 'mx 


log 


,fc=l 


PYk\X:o,k0^k\Xx,k) 


For each wx G A, let 


l0g?t«i - E; 


"Wx -^Px",V'"|Wx = u 


log 


Lfc=l 


.■®mxTc,fc(Efc|2fr7fc) 
PmXx,fc(Efc|9fx,fc) 


> log 6 


Wx 


SY,,\XTC,t(Xk\XT-,k) 


+ 




1 _ ;y^^'^PX^,Y<^lWx = n,x 


XI log 


.k=l 


PY^\Xx,k0^k\Xx,k) 
^Yk\XT'^,k (E/c 


Using Chebyshev’s inequality, it follows from (125) that for each wx G A 


TD-p 

^ ^PX^,YT^\Wj^ = ivj_ 

which implies from (124) that 

log 


Xlog 

^ k=l 


PYk\Xx,k0^k\Xx,k) 

'®mxT^fc(Efc|9^T^fc) 


> log^«,x \ < 


1-7 




< log + log 


1-7 


Since 1 1 —)• log j is convex for f > 0, by Jensen’s inequality 


X Ptyi('«^x)log 

Wx&A 




{wT\wT-)j ( 


We have 


X PWxMSWx\WxcMwt^)^'= Tjr X ^WxlWrcMwT^) 

Wx&A Wx&A 

WtGA-T 


T.wxeA PWx iwi)Sw^lWxc i'^T I WT-^ 

( 100 ) 1 
“ ^ 

(^) J_ 

■ 1-41 


wtGWt 

1 

■R 

where (a) follows from the definition of At in (11) and the fact stated in Statement (ii) that wt‘= = w^. 
Wx G A. Using (128) and (132), we obtain 


X PWx{wx)log 

Wx&A 


1 


> log 1^1. 


SWx\Wxci^T\wT^ 

Taking expectation with respect to on both sides of (127) and applying (133), we obtain 


log 1^1 < f ^ PWi('«^x)log^«,x ) +log(T^) 

\wx&A ) \ "i J 


(123) 

(124) 

(125) 

(126) 

(127) 

(128) 

(129) 

(130) 

(131) 

(132) 
for all 

(133) 


( 134 ) 
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E. Simplification of Log-Likelihood Terms 

In order to simplify (134), we will simplify the log-likelihood term in log^^^ defined in (125). To this end, we 
first let x2{wi) = ffwi) is the encoding function at node i defined in Definition 1) and we also let Xi^k{wi) 
denote the component of x2{wi) for each i S X and each A: € {1, 2,... n} such that 


In addition, we let 


and we let 


x2iWi) = {Xip{Wi),Xi^2{Wi), . . . ,Xi^n(.Wi)). 

XX,k{wx) = {xi,k{wi),X2,kiw2), • • • , XN,k{wN)), 

XT-,k{wT-) = (Xj^kiWj) \j € T'") 


he a suhtuple of xx,k{wx)- Similarly, let 

Xiiwx) = {x2{wi),x2{w2),... ,x%{wn)), 

and let 

xUwT^)^{x]{Wj)\jGT^) 


(135) 

(136) 

(137) 

(138) 

(139) 


he a suhtuple of xf{wx)- Using the fact that Xf is a function of Wi for alH € X and the notations defined above, 
we obtain from (125) that 




log 


.k=l 


PY^\X:,,>,{Yk\xx,k{wx)) 

SYk\XTc,k0^k\xT‘^,k{wT'=)) 


+ 


\ l-'y 


Var 




log 


.k=l 


Pn|Xx..(^fckx,fc('Wx)) 


which implies from (16) that 

log^ijjz — 


XI log 


.k=l 


SYk\XT<^,k{^k\xT-,k{wTfi) 
PYu\X^,A^k\xx,k{wx)) 


SYk\XTC,k0^k\xT-,k{wT-)) 


+ 


^1-7 


Var 


nfc=i p5^fci-^x,fc='^x,fc(“i) 


Xlog 


.k=l 


PY^\X^,Ayk\xx,k{wx)) 


which then implies that 


logCt«x = 


l^x.fc ~^X,fc (^x) 


k=l 


+ 


log 


SYk\X-r<^,k{yk\xT-,k{wT-)) 

PYk\X:,,kiyk\xx,k{wx)) 


^Yk\XT<^,kiyk\xT-,k{'^T-)) 




ITT E Var; 


l^x,fc=^x,fc(^x) 


k=l 


log 


PYk\x^,SYk\xx,k{wx)) 

SYk\X-r<^,k{yk\xT‘^,k{wT-)) 


Following (142), we use (17), (15) and (108) to obtain 

PYk\X:,,k{yk\xx,k{wx)) 


log 


•Sni Xxc, (^fc krsfe {WT -)) 


-log 11 + Y, Pi] + 


log e 


ieT 


2(1 + Y^ierPi) 


X “ X ^i,ki'^i) 


KieT 


iex 


T2 j 'S ^ ixi^kiwi) \^i,k'\) ] ( Ufc ''fi ^ X^^ki'^i) ] 3" ( Y~\iXi,ki'iiii) [^i,k]) ] 

VieT /V i&X J VieT ’’ / 


(140) 


(141) 


(142) 


( 143 ) 
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For each wx & A and each k £ {1,2,..., n}, it follows from Definition 2 that — Yhi&x ^i,k{wi) is a standard 
normal random variable if is distributed according to PYk\Xi k=xx k{wx)^ which then implies that 


E, 


‘P'^k\^X,k=^X,k('^z) 


log 


(143) 1 
2 


PYk\Xx.k0^k\xx,k{wx)) 

SYk\XTC,k(Xk\xT-,k{'>^T'=)) 

log e 


7^2(1+ Ei,T-f'. 


Pj I “h I y ^y iXi,k{'>Xi) ^ [Xj /j]) 

\i&T J \i£T ' / 


and 


Var 


'^A: I ^ X, fc — ^ X, fc (^ Z ) 


log 


PYk\Xx,k0^k\xx,k{wx)) 

SYk\XTC,k0^k\xT-,k{wT-)) 


(143) 


log e 


Var- 




i&T 


\2(1 + 

~k 2 I y ^y Xj^ki'U^i) k ^ y y Xi^ki'^i) 


Xi,k{Wi) 


iex 


\ieT 


iex 


Define 


(SieT-^*) ~k \^i£Ti^i,k{Wi) En^j. ^ [^i,fe]) j ) (log c) 
2(1 + Z^ieT 


\PT\=^P^ 

i&T 


and 


Xijki'UJi') — Xi^kipXi) 

Combining (134), (142), (144), (146), (147) and (148), we obfain for each wx £ A 

T.wx&APWxiwx) (-nlPrl + ELi {EiexHkiwi))'^^ loge 


n , 


log|,A| < -log(l + \Pt\) + 


+ 


_ 2(1 + |Et|) 

T.wxeAPWx(wx) y(n|PTP + 2ELi {^iex log e 


+ log 


1-7 


(1 + |Pt|)v/1 - 7 

which implies from Jensen’s inequality (t <-A y/i is concave for f > 0) that 

-n\PT\ + ELi Y.wxeAPWxiwx) (E ))E loge 


n 


log 1^1 < -log(l + \Pt\) + 


+ 


2(1 + |Pt|) 

^n|PTp + 2 ELi T.n,xeAPWx{wx) (Eier^*,fc(w^*))^loge 
(1 + IPtDa/I - 7 


+ log 


1-7 


(144) 


(145) 


(146) 


(147) 


(148) 


(149) 


( 150 ) 


In the following, we will obtain an upper bound on the crucial term J2k=i'^wx&APWxi'‘Xx) (Eier 
which appears in the second and third terms on the right-hand-side of (150). 
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F. Introducing the Quantized Input Distribution to Simplify the Upper Bound 
Following (150), we consider for each k £ {1,2,... ,n} 

PwAwx) '^Xi^ki'Wi) 


WxGA 


vieT 


I I Y PWr-lWri'^T^WT) 

xiST / WT'^^y^T'^ 


Y PWt{wt) 


Wt^-A^T 


sieT 




(151) 

(152) 

(153) 


Wt^Wt 


sieT 


Since Xf is a function of Wi for each i G T, it follows from (148) that for each k € (1,2,... ,n} 


Y PWt{wt) ( 

\ieT 


wt^Wt 

which implies from (153) that 


k{xT,k) - J , (154) 


XT,k£^T 


sieT 


Y PwAwi) Y^bkiWi) < Y PXT,AxT,k) Y i^kk - , lYk. 


Wx&A 


(155) 


vieT 


XT,k^^T 


sieT 


Recalling the definition of Xf and Xf in (102) and (103) respectively, we write for each k £ {1,2,... ,n} 


Y Y (^kk - ^ [Xi 


XT,k^^T 


siGT 


Y PXx.k,XT.S^Fk,XT,k) 

,k^^T i^T,k^^T 

Yh PXT.k,XT,S^'^Fl^T,k) 

^T,k^^T ■,XT,k^^T 


E 

\i&T 


Xi,k + Xi^k ^ \Xi^k 


(156) 


^ Xi,k') 

\i&T 


+ 2 ^ ^ FxT.fcjXr.fc ^ ^, (.Xi,k Xik) \ f ^ ^^ {Xi,k ,^[Xi,k]) 

XT,k&XT,XT,k&XT ' Xicyr / 

3" 'Y^ ~'^u^_^[Xi,k\) 1 


(157) 


^T,k^^T 


\iGT 


^T,k^^T ■,XT,k^^T 


^ ^i,kj 

iGT 


+ 2 Y, T'Xr.fc.Xr.fc^^Efc) ^'r,fc) 

XT,k^^TiXT,k^^T 




iGT 




iGT 


+ Y^ Px.„ S^T,k) ( 'YX^iyk Eu^_^[Xj^fc]) 


(158) 


^T,k^^T 


siGT 
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XT,k£^T ■,XT,k£^T 


^ ^ \^i,k ^i,k\ 

\i£T 


+ 2 E Px^,k,X^,S^T,k,XT,k){Yl 


X'i ^ Xi ^ I 


^T,k^^T AT,k^^T 


\ieT 


+ ^Ux. ^ [\^'l 

\i&T 


i,k\ 


+ PXt.S^'^’^^ ( ^ux. ^[^i,k]) 


&T,k^^T 

~ m? ^Jn 


\ieT 


(EVflj+ E [ 'y '^ux. ^[^i,k]) 


(159) 

(160) 


XT.k&^T 


kieT 


where (a) follows from the facts helow for each i ^ T, each k £ {1,2,... ,n} and each £ Xi (recall the 
definition of Xi^k in (102)): 


and 


(37) 1 

\^i,k ®i,fc E 

n 


(43) (12) - 

|^'i,/!;| ^ ^ y'klP-i . 


(161) 


(162) 


G. Approximating the Quantized Input Distribution by a Product Distribution 

In order to bound the last term in (160), we use the bound in (105) for bounding Px^ ^{xT,k) in terms of 
Ux^^{xT,k) to obtain 


Px^ S^T',k) ( yy(^i,fc Etijf. ^ [-^i,fc]) 

XT.k^^T \i^T / 


r 

^T,k^^T ' ' 


E. 

XT,kG^T ^ 


1 + 


logn 


n 


logn 


n 


n + ^ 4 |t| j j (163) 


teT 


VIST 


J. 1. ^Xi k ( y y ^i,k ^ux^ i^i^iyk]') 


i&T 


kieT 


.^4|T| ^ux.^[^i,k]) 


kieT 


(164) 


for each k £ {1,2,... ,n}. The bound in (164) consists of two distinct terms which we now bound separately. 
Consider the following two chains of inequalities for each A:£{l,2,...,n}: 


E (u^x 

&T,ke^T 


{ii,k) 




=ee. 

ieT 


{d^i,k j, 


[Xi,k]f 


<Ee. 

i&T 



(165) 


( 166 ) 
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and 


^T,k&^T ^ 


- yz (^1^1 jXi_fc]||^ 

XT,k€^^T 

XT,k^^T 

<\T\" E 


^T,k^^T 


(a) 


<2|T|2 E(^?.A: + (E.^,,fc])‘ 

^T,k^^T 

( 162 ) 

< 2|r|2 5] 5]2nP, 


XT,k&XT 


(b) 


<4n|rhPT||^r| 


T 4n3|^l|r|2|PT| n( 2 /^ + 3), 

i&T 

where 

(a) follows from the fact that (a — 6)^ < 2a^ + 26^ for all real numbers a and b. 
(h) follows from the definition of \Pt\ in (147). 

Combining (164), (166) and (173), we obtain for each fe € {1, 2,... , n} 

2 


yy Px^ s^T,k) I y~~](^i,fc ^ [Xj^fc]) I 

. u(^X^ \i&T / 

+ 4n-l^l|r|2|PTin(2/^ + 3), 


XT,k^^T 




ier 


ieT 


which implies from (155) and (160) that 

E pwziwx) I y^xi,fc(w'i) I 

WxGA \i&T / 

'' VieT / \ / i&T i&T 

Using (175) and (53) and recalling that |r| > 1 (because T is non-empty), we obtain 


logn 


n 


Ee. 


^i,k 


E E pwxiwx) f 

k=l wxGA \i€T 




< nlPxl + ^/nlognlPxl + 4Vn|r| IE uij + '‘1^1 vti n(2/fi+3)+ 


ieT 


|r|^ 


n 


To simplify notation, let 


A 

Kl = 


4|r| and K2 = 4|T|2|PTin(2V^ + 3) 

VieT / 


(167) 

(168) 

(169) 

(170) 

(171) 

(172) 

(173) 


(174) 


+ 4n-l^l|r|2|Prin(2V^ + 3). (175) 


(176) 


iST 


( 177 ) 
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be two constants that are independent of n. Then, we combine (150) and (176) to yield 

[y/n \ogn\PT\ + y/uKi + K 2 + n~^\T\‘^) loge 


n 


log 1^1 < -log(l + |Pr|) + 


+ 


2(1 + |Pt|) 

\/n\PT\{\PT\ + 2) + 2y/nlogn\PT\ + 2y/nKi + 2k 2 + 2n-^\T\^ loge 

(1 + |-Pt|)a/1 - 7 


+ log 


1-7 


(178) 


Combining (101) and (178), we obtain 


-4|^|(l + 37)^ r-^ — ,1 

-;-Z- v/re log n + log 

1-7 / 


1-7 
2(1 + 7) 


+ ^ log M> 


(n) 


ieT 


n ,, , ID |x , {Vnlogn\PT\ +y/nniP K2 + n ^\T\^)loge 

<-log(l + |Prl) +- 2(1 + |PH) - 

y/n\PT\{\PT \ + 2) + 2y/nlogn\PT\ + 2y/nKi + 2k2 + 2n“i|Tpioge 

^ {1 + \Pt\)VT^ 


1-7 


(179) 


Dividing both sides of (179) by n and taking limit inferior as n goes to infinity, we obtain from (96) that (97) 
holds as desired. This completes the proof of Theorem 1 . 


H. Discussion on the Choices of the Quantizer’s Precision and the Parameters Used in the Wringing Technique 
in (69) 


Our choice of 6 in (69) has been optimized in the following sense. If 5 is chosen such that 5 = o 

then the second-order term on the RHS of (179) would be cj [y/n log n) (cf. (105) and (163)), which then leads to 
an upper bound on X)ieTl^S'''ill' ^ looser (larger) second-order term ui{y/n\ogn)', if 5 is chosen such that 

5 = a; , then the magnitude of the first term on the LHS of (179) would be oj {y/n log n) (cf. (101)), 

which then leads to an upper bound on X)jeT "''ll' ^ looser second-order term oj{y/n logn). Hence our 

choice of 5 = “balances” the rates of growth of the two second-order terms in (179). In this sense, our 

choice of 6 is optimal. 

We now discuss the choice of the quantizer’s precision A„ = 1/n as shown in (103). Based on this choice of 
An, we note that any choice of A in (69) satisfying = o{y/n logn) does not affect the second-order term 

of the resultant upper bound on implied by (179). In particular, the current choice A = stated 

in (69) leads to the rightmost term in (105), which contributes to the fourth constant term in (176) as well as the 
constant term on the RHS of (179). 

If the quantizer’s precision is chosen to be some other A(^, then it can be seen by inspecting (161), the upper 
bound obtained at step (a) in the chain of inequalities leading to (160), (176) and (178) that the second-order term 
of resultant upper bound on XlieTl^Sis H (max{-^nlogn, A(jn^/^}). In particular, if A'„ is chosen such 
that D(^) <A'n<0 

of A) to conclude that the second-order term of the upper bound on Yhi&T is proportional to y/n log n. As 

explained in the second remark after Lemma 3, as long as A'^ decays to zero no faster than polynomially in re, then 
\Xt\ grows at most polynomially fast in re, which will ensure that the asymptotic rates of the resultant sequence 
of subcodes obtained from the wringing step are the same as that of the original sequence of codes. However, if 
A'n decays to zero exponentially fast (i.e., A'„ = 0(2“"^) for some b > 0), then \Xt\ will grow exponentially fast 
in re and the RHS of (70) will decay exponentially rather than polynomially fast. This in turn causes the asymptotic 
rates of the resultant sequence of subcodes to decrease by a positive quantity, thus resulting in a loose first-order 
term on the RHS of the final inequality (179) (which does not match the corresponding term in the Cover-Wyner 
capacity region). Therefore, with this choice of A^, the strong converse cannot be shown. 


Vlog» 

n 


for any fixed a > 1, we can follow similar calculations (with a slight modification 
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VI. Interference Channel under Strong Interference Regime 

The capacity region of a two-source two-destination Gaussian interference channel (IC) under strong interference 
was derived hy Han and Kohayashi [24] and Sato [25]. Let Pi,P 2 he the received signal-to-noise ratios and let 
/i ,/2 he the received interference-to-noise ratios [I, Sec. 6.4]. Under the formulation of the Gaussian IC under 
strong interference, it is assumed that I 2 > Pi and h > P 2 - Under this condition, the capacity region was shown 
in [24, Th. 5.2] to he the Han-Kohayashi region 



/ 

< 5log(l + Pi), } 

7^HK-S — 

(7?i,i?2) GK+ 

R2<\^og{l +P2), 1 


V 

Ri+R2< min {2 log(l -f Pi -f p), \ log(l + P2 + h)} J 


By applying Theorem 1 to each of the decoders of the two-source two-destination Gaussian IC, we can show that 
the corresponding (ei, £ 2 )-capacity region is outer hounded as 

Cei,e2 — ^HK-s ( 181 ) 

as long as ei -)-£2 < l^ where e* characterizes the asymptotic average prohahility of destination i decoding message i 
wrongly. Since the rate pairs in T^hk s are (0, 0)-achievahle via simultaneous non-unique decoding [1, Sec. 6.4], we 
have 

^ei,e2 ~ Phk-s (182) 

as long as ei -t- £2 < 1- The strong converse (in fact, the complete second-order asymptotics) for the Gaussian IC 
under the more restrictive condition of strictly very strong interference was shown hy Le, Tan, and Motani [26]. In 
the rest of this section, we will describe the formulation of the Gaussian IC under strong interference and present 
in Section VI-B the corresponding strong converse result. 


A. Problem Formulation and Main Result 


We follow the standard setting of the Gaussian IC under strong interference as given in [24, Sec. V]. The Gaussian 
IC under strong interference consists of two sources, denoted hy si and S 2 respectively, and two destinations, denoted 
hy di and d 2 respectively. For each i G {1,2}, chooses a message Wi and transmits Xf in n time slots, and dj 
receives Yf in n time slots and declares Wi to he the transmitted Wi. The channel law in each time slot k is 


■ ■ 

= 

1 

921 

912 

1 

■ ■ 

^2,k 

+ 

1 - 1 


(183) 


where 321 and gi 2 are two real constants characterizing the channel gains of the interference links, and {(Zi fc, Z 2 ^k)}^=i 
are n independent copies of a Gaussian random vector denoted hy {Zi, Z 2 ) (Zi and Z 2 need not he independent) 
such that 


E [Zi] = E [Z 2 ] = 0 (184) 

and 

E [Zj] = E [Z|] = 1. (185) 

For each i G {1,2}, the codewords transmitted hy should satisfy the peak power constraint 

Pr {\\Xff < nPi} = 1 (186) 

for some Pi > 0. We assume that the IC is under strong interference, i.e., ^ 1 and ^ l^ which implies that 

h=9l2P2>P2 (187) 

and 

h = glPi>Pu (188) 

where Ii and I 2 characterize the interference power received at di and d 2 respectively (cf. (183)). The Gaussian 
IC is characterized hy some conditional prohahility density function qYi,Y 2 \Xi,X 2 and we define the Gaussian IC in 












25 


a similar way to a Gaussian MAC (cf. Definition 2) such that (183), (184) and (185) hold. In addition, we define 
a length-n code for the Gaussian IC as follows. 

Definition 8: An {n, M^\ m!^\ Pi, P 2 )-code for the Gaussian IC consists of the following: 

1) A message set Wi = {1,2,..., at node i for each i £ (1, 2}, where Wi is uniform on Wj. 

2) An encoding function /j : Wi — )• M” for each i £ {1,2}, where is the encoding function at node i such 
that Xf = fi{Wi) and ||/i(tUi)|p < nPi for all Wi £ Wi. 

3) A (possibly stochastic) decoding function : M” —>• Wi for each i £ {1,2}, where ipi is used hy node dj to 
estimate Wi, i.e., Wi = ^i{Yfi). 

We define an (n, Pi, P 2 , ^i, £ 2 )avg-code as follows. 

Definition 9: For an (n, Pi, P 2 )-code defined on fhe Gaussian IC, fhe average probability of decod¬ 

ing error for Wi is defined for each i £ {1, 2} as 

Pr{Wi^Wi}. (189) 

An {n, M 2 ^\ Pi, P 2 )-code wifh Pr\Wi / FFi} < £i and Pr{lX 2 W 2 } < £2 is called an 

Pi,P2,ei,e2)avg-code. 

For each ei £ [0,1) and each £2 £ [0,1), we define an (ei,£ 2 )-achievable rale pair as in Definition 4, and we 
define fhe (ei, e 2 )-capacily region, denoled by to be the set of (ei, £ 2 )-achievable rate pairs. The following 

theorem is the main result in this section. 

Theorem 2: For each £1 £ [0,1) and each £2 £ [0,1) such that £1 + £2 < 1, 

Cei,e 2 ~ P-HK-s- (190) 


B. Proof of Theorem 2 

We need the following definitions and lemma before presenting the proof of Theorem 2. The definition below 
concerning a multicast code differs from Definition 8 in the decoding functions only, but we state the whole 
definition for clarity. Essentially, a multicast code for the Gaussian IC is the same as a standard code except that 
each decoder must output estimates of both messages. 

Definition 10: An {n,M^\M!^\Pi,P 2 )-multicast code for the Gaussian IC consists of the following: 

1) A message set Wi = {1,2,..., at node i for each i £ {1,2}, where Wi is uniform on Wi. 

2) An encoding function : Wi M” for each i £ {1,2}, where is the encoding function at node i such 
that Xf = fi{Wi) and \\fi{wi)\\‘^ < nPi for all Wi £ Wi. 

3) A (possibly stochastic) decoding function (pi : R” —)• >Vi x 1 X 2 for each i £ {1,2}, where pi is used by 
node dj to estimate both Wi and W 2 such that the pair of message estimates is {Wi^,\., lX 2 ,di) — TiiXf)- 

We define an (n, Pi, P 2 , £ 1 , £ 2 )avg-inulticast code as follows. Note that the multicast code is used 

for the Gaussian IC but not a general multicast channel. 

Definition 11: For an (n, M^\m^\Pi, P 2 )-multicast code defined on the Gaussian IC, the average probability 
of decoding error at destination dj is defined for each i £ {1, 2} as 

Pr{{lXi.d,/FFi}u{FF2,d. (191) 

An (n, Pi, P 2 )-multicast code with average probability of decoding error at destination d* no larger 

than £i for each i £ {1,2} is called an [n, m[^\ M 2 ^\ Pi, P 2 , £ 1 , £ 2 )mg-code. 

The following lemma plays a crucial role in extending our strong converse result for the Gaussian MAC to 
the Gaussian IC under strong interference, because it relates the error probabilities for standard codes defined 
for the Gaussian IC in Definition 9 to the error probabilities for multicast-codes defined for the Gaussian IC in 
Definition 11. 
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Lemma 6: For each (n, Pi, ^2, ei, £2)avg-code for the Gaussian IC, there exists an 

Pi, P2, ei + £2, El + £2)avg-multicast code for the Gaussian IC. 

Proof: Suppose we are given an (n, M^\ m!^\ Pi, P2, £1, £2)avg-code whose encoding and stochastic decoding 
functions are denoted hy (/i,/2) and (991,992) respectively (cf. Definition 8). Let pwi,W 2 ,x^,x^,Y{‘,Y^,z^,z^ be the 
prohahility distribution induced hy the (n, M2"'\ Pi, P2, £1, £2)avg-code. By Definition 9, we have for each 

i G{1,2} 


Pr 


PWi , Y 7' 


{p,{Yf) / W,} < £, 


(192) 


which implies from (183) that 


(Mfiim) + gi2f2(W2) + z^) / IPi} < £i (193) 

and 

W2{g2lfl{W,) + f2{W2) + Zf) / W2} < £2 . (194) 

In the rest of the proof, we construct new stochastic decoding functions at di and d2, denoted hy (p[ and 992 respec¬ 
tively, such that (991,99'^) and (992,992) can he viewed as the stochastic decoding functions of an 
Pi, P2, £1 +£2, £1 + £2 )avg -multicast code. To this end, we first define Z^ and Z2 to he n independent copies of the 
standard normal random variable such that Zf, Zf and (X^, Z^, Zf) are independent. In addition, 

there exist G Wi and G VV’2 such that 

Prpw„w„vn {v^2(>T) + m = arg min y2{Yf) / lP2|fPi = (195) 

and 

Prpw„w„.n {ipi{Yf) ^ iyi|iy2 = W*,} = arg jnm^ {¥^1(^1") / IP1IIP2 = W2} , (196) 

which implies from (193) and (194) that 

Prpw„.J W2{g2lh{w\) + f2{W2) + Zf) 1^2} < £2 (197) 

and 

Pw„.r {¥^i(/i(lPi) + 5 i2/2(^) + Z^) ^ IPi} < £i . (198) 

Then, we define fhe sfochasfic decoders 

, Yf-hMYf)) , 

) — 7^2 5'2i/i(^r'i)-I-1-1/1-^2^2 (199) 

V 912 V gh j 

and 

p'^{Yf) ^2" - f 2 {P 2 {Yf)) ^ ^ ^ (200) 

V 521 y ffii j 

where fhe randomness properfies of fhe sfochasfic functions originafe from not only 991 and 992 but also Z” and Zf. 
Since 

g2lfl{wl) + f2{W2) + zf (201) 


and 


g2ifiiwl) + 


Yf-h{Wi) 


gi2 


^ r ■ W2^^ 


have the same distribution by (183), it follows from (197) and (199) that 


n{v,(F,”) = IVi)} 


(202) 


(203) 


< £2 ■ 


(204) 
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Combining (199) and (204), we obtain 

{MYn / or ^[(YD / IC 2 } 

= Mrn 7^ ^ 1 } + {{¥^' 1 (^ 1 ") / w^ 2 } n {<^i(yi") = iCi}} (205) 

< ei + £2 • (206) 

Following similar procedures for deriving (206), we obtain the following inequality by using (183), (198) and (200): 

Mr 2 n / or / IPi} < £1 + £2 . (207) 

Replacing the decoding functions of the (n, Mj"'\ Pi, P2, £1, £2)avg-code with ((fi,(p[) and (yj2,¥?2) 
keeping the encoding functions unchanged, we conclude from (206) and (207) that the resultant code is an 

(n, PI^\ m!^\ Pi, P2, £1 + £ 2 , ei + e2)avg-inulticast code. ■ 

We are now ready to prove the strong converse theorem for the Gaussian IC under strong interference. 

Proof of Theorem 2: Fix £1 > 0 and £2 > 0 such that 

£i+£2< 1. (208) 

As discussed at the beginning of Section VI, it follows from Theorem 5.2 in [24] that Co,o = I^hk-s where the 
quantities li and I 2 in T^hk-s are defined in (187) and (188) respectively. Since Co,o C C£i,e 2 for all non-negative 
real numbers £1 and £2 by definition, 

7^HK-s (209) 

Therefore, it suffices to prove 

C£l,e2 — ^HK-S- (210) 

To this end, fix a rate pair (i?i,i? 2 ) G Cei,£ 2 - definition, there exists a sequence of {n,Mf\Mf\Pi,P2, 
£(^”\ £ 2 ”'^)avg-codes such that 

liminf — log > Pi (211) 

n^oo n ^ 

and 

lim sup £j-"'^ < £j (212) 

n—)-oo 

for each i G {1,2}. It then following from Lemma 6 and (212) that there exists a sequence of 
Pi, P 2 , £ 1 ^"^ £ 2 "^^)avg-inulticast codes such that 

limsupEj-”^ < £1 +£2 (213) 

n^oo 

for each i G { 1 , 2 }. 

Construct a subnetwork of the Gaussian IC formed by deleting d 2 as well as the links connecting to it. By inspec¬ 
tion, the resultant subnetwork is a two-source Gaussian MAC and the sequence of (n, ,Pi,P 2 , £i"^, £r)avg- 

multicast codes for the Gaussian IC induces a sequence of (n, , Pi, P 2 , £{^ )avg-codes for the two-source 

Gaussian MAC. It then follows from (211) and (213) that {Ri,R 2 ) is (£1 -|- 62 )-achievable for the two-source 
Gaussian MAC, which implies from Theorem 1, (208) and (183) that 

Pi < ilog(l + Pi), (214) 


P2 < ^ log (1 + § 12 ^ 2 ) 
and 

Pi -|- P2 < — log (1 -|- Pi -|- gi2P2) ■ 


(215) 


( 216 ) 


28 


Similarly, if we repeat the above procedures for the other two-source Gaussian MAC resulting from deleting di 
from the Gaussian IC, we obtain 

Ri<]^\og[l +ghPi) , (217) 

< ^log(l + P2) (218) 

and 

i?l + i?2 < 2 (l + 921^1 + P'i) ■ (219) 

Combining the bounds in (214), (216), (218), (219), the capacity region in (180), and the strong interference 

conditions in (187) and (188), we have (7?i,722) £ ^hks- Consequently, the outer bound in (210) holds, and the 

theorem follows from (210) and the inner bound stated in (209). ■ 


Appendix 


Proof of Lemma 1: Suppose an (n, Pj, e)avg-code is given for some e G [0,1), and let 

e^u2- = Pi[Wx i^wx\Wx = wx] 


( 220 ) 


be the probability of decoding error given that wx is the message tuple transmitted by the sources. Then by choosing 
Wx one by one in an increasing order of we can construct a set P C Wx such that 

1 + S 


Pr|l4^i 7^ Wx I Wx = Wx} ^ 


for all Wx G V and 


\V\ > 


1 — e 

1 + e / 


nw 

i&X 


(n) 


( 221 ) 


( 222 ) 


This is essentially an expurgation argument. The bound in (221) means that there exists an (re, M^\ Px,'D,'I, 
code such that (222) holds. Fix a nonempty T PI. Define 


for each wt‘= G such that 


Since |>Vr<=| = it follows from (222) and (224) that there exists a wfc G Wr<= such that 


Pwtc — {wx e P I WT<^ = WT<^] 
\V^^f = \V\. 


(223) 

(224) 


\Pw*c \ — 


1 — e 
1 + £ 


nw 

i&T 


(n) 


or otherwise we would obtain the following chain of inequalities which would eventually contradict (222): 

(224) 


W'j'C G Wtc 


< |Wr<= 


1 — £ 
!+£/:* 


HA 


(n) 


riM 


(n) 


ieT‘= 


i&T 
1 — £ 

1 + £ 


n“, 

ieT 


(n) 


< 


1 — £ 
1+6 


ri". 

i&X 


(n) 


(225) 

(226) 

(227) 

(228) 
(229) 
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which contradicts (222). Due to (225), we can construct an (n,A4”\i^)^^^-code based on the 
Px,P,X, i^)^^^-code such that they have the same message sets, encoding functions and decoding 
function and differ in only the support set of the message tuple Wx (cf. Definition 1). In particular, the second 
statement in Definition 1 is satisfied because of the following reasons: 

1) By construction, Wx is uniform on 

2) For all wx £ we have wt<^ = by (223). 

Let A = 'Dio*,,- It remains to show that (30) and (31) hold for the M^\ Px,A,T, i^)^^-code. Recalling the 
definition of At in (11), we obtain from (223) that 


which implies from (225) that 


1^1 = |,Ar| = |Do 


1^1 = l^rl > 


1 — £ 
1 + 


nw 

i£T 


(n) 


(230) 


(231) 


Consequently, (30) follows from (230), (231) and (29). It remains to prove (31). To this end, let Wt 

denote the probability distribution induced on the Gaussian MAC by the (n,M)”\Fx,.4,r,q^)^^-code. where 


for all Wt S -4.t by Definition 1. Using (232) and (30), we obtain 


for each wt G At- 


PWt{wt) < 


1 


OieT 



(^) 


(232) 


(233) 
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